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(57) Abstract 



There is disclosed a method for examining polymorphisms in a nucleotide sequence of a first sample of nucleic acid com- 
prising the steps of: (a) forming a population of nucleic acid fragments bearing the said nucleotide sequence; (b) combining the 
population of nucleic acid fragments with a population of a synthetic nucleotide construct which construct is capable of forming 
duplexes with the nucleic acid fragments but which contains one or more deliberate nucleotide substitution(s) and/or deletion(s) 
and/or insertions(s) which is/ are (i) opposite to a known variable nucleotide or sequence of nucleotides within the sequence un- 
der examination and/or (ii) contiguous with a nucleotide which is opposite to a known variable nucleotide or sequence of nucleo- 
tides within the sequence under examination; (c) permitting duplex formation within the combined populations; and (d) separat- 
ing the duplexes formed in step (c). 
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NUCLEIC ACID ANALYSIS 

This invention relates to a method for examining 
single or multiple nucleotide polymorphisms (or 
mutations) in a nucleotide sequence In particular, but 
5 not exclusively, a nucleotide sequence contained within 
a sample of mammalian genomic DNA sequence. 

Variations between individuals in the DNA sequence 
of genes coding for functional proteins account for 
many so-called "genetic diseases". For example, a 

10 single base pair substitution in the beta-globin gene 
leads to the genetic disease known as beta- 
thalassaemia. Similarly, these variations, 
particularly in the genes coding for functional Human 
Leucocyte antigens (HLA), explain many of the rejection 

15 problems encountered in human allogeneic transplant 
operations. Considerable effort is required to "HLA 
match" the donor to the recipient (patient). 

Recently, a technique known as PCR (polymerase 
chain reaction) fingerprinting has been developed for 

20 human HLA-DR/Dw allotype matching (Bidwell JL and Hui 
KM 1990). As discussed in this paper, this application 
of the polymerase chain reaction technique permits the 
rapid matching between individuals of HLA-DR-Dw 
specifies. The technique exploits the formation of 

25 mismatched DNA hybrids (heteroduplexes) which are 

formed at the end of each PCR cycle between coampllfled 
products of the second exons of expressed DRB genes 
(DRBl, DRB3, DRB4, and DRB5) and DRB pseudogenes (DRB2, 
DRB6, DRB7) (Also see Clay TM, Bidwell JL, Howard MR 

30 and Bradley BA 1991 and Wood NAP, Clay TM and Bidwell 
JL 1991). During the primer-annealing stage of each 
cycle of the PCR, a proportion of coding strands of 
each DRB locus may hybridise to noncoding strands of a 
different DRB locus and vice- versa. Resulting 

35 molecules contain double stranded regions where the 
nucleotide sequence is complementary, but regions of 
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nucleotide sequence mismatch form single stranded loops 
along the length of the mplecule. The number, size and 
position of the single stranded loops vary for each DR 
haplotype and each combination of haplotypes, in both 
5 DR-Dw homozygous and in heterozygous individuals. 

Because the molecular conformations of heteroduplexes 
vary, they may be separated from each other, and from 
the corresponding homoduplexes, by, for example, gel 
retardation analysis in nondenaturing polyacrylamide 
10 gels* In practice, haplotype- specific gel banding 
patterns (termed PCR fingerprints) are observed. 
Matching patterns of PCR fingerprints between two 
individuals indicate DR-Dw identity, whereas mismatched 
patterns indicate DR-Dw disparity: visual examination 
15 of ethidium bromide-stained polyacrylamide minigels 
thus constitutes^ a very simple test for DR-Dw 
compatibility • 

PCR fingerprints observed in DR-Dw heterozygous 
individuals may be exactly reproduced by mixing DNAs 
20 from two corresponding DR-Dw homozygous typing cells 
(HTCs) prior to PCR (Bidwell and Hui 1990), However, 
in DR-Dw heterozygotes, additional bands may be 
observed which are not present in either of the 
patterns from the HTCs. These additional bands arise 
25 from heteroduplex formation in trans, that is, between 
PCR products from the two different haplotypes. This 
phenomenon is exploited in the DMA crossmatch test 
(Wood et al 1991, Clay et al 1991). Here, genomic DNAs 
from two different individuals are coamplified in the 
30 PCR. The resulting PCR fingerprint is compared with 
those obtained by separate PCRs for each individual. 
Matching PCR fingerprints from all three products 
indicates DR-Dw identity, whereas mismatch is indicated 
by different patterns and/or the appearance of new 
35 heteroduplexes formed in trans. This DNA crossmatching 
test may be successfully exploited to perform limited 



1 



wo 93/19201 



PCT/GB93/00616 



-3- 

* DR-Dw "typing": for example, DNA crossmatches with DR7- 

DQ9 and DR9-DQ9 HTCs may be used to discriminate 
between these alleles where conventional DNA-RFLP 
typing fails. 

5 Theoretically, the number of conformationally 

distinct heteroduplexes generated from a DR-Dw 
homozygous or heterozygous individual with a total of n 
different alleles at coamplifiable loci, is - n 
(Bidwell and Hui, 1990). In practice, particularly in 
10 DR-Dw heterozygotes, not all of the predicted 

conformationally distinct heteroduplexes are observed. 
A likely reason for this incomplete resolution is that 
a proportion of heteroduplexes comigrate and are thus 
superimposed in gel retardation analysis. In addition, 

15 some heteroduplexes may form less avidly or be less 
stable than others. In general, however, there are 
more than sufficient discriminatory heteroduplexes to 
permit matching, but in a few cases (for example, 
discrimination between certain DR4 alleles in DR4- 

20 homozygous Individuals), PCR "spiking" may be 

necessary. PCR spiking Is the inclusion or "spiking" 
of PCRs with DMA from a DR-Dw disparate HTC, for 
example a DR8 HTC. Spiking PCRs in this manner with 
third party DNA generates new heteroduplexes which 

25 enhance discrimination between otherwise similar PCR 
fingerprints (Clay et al 1991 and Bidwell JL 1992). 
DNA from a homozygous DR8 cell line has been 
successfully used as a "spiking" reagent since it 
generates a defined but limited number of new 

30 heteroduplexes. 

Although PCR fingerprinting works well for HLA- 
DR/Dw matching, extension of the principle to matching 
HLA-DP and DQ (other loci in the human HLA region 
coding for alloantigenic functional proteins) is not 

35 directly achievable because there are insufficient 
ampliflable loci. Moreover, the PCR fingerprinting 
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technique is unsuitable as a generalised method for 
eKamining single or low niimbers of dispersed nucleotide 
polymorphisms, such as those which result in genetic 
diseases. 

5 According to a first aspect of the present 

invention, there is provided a method for examining 
polymorphisms in a nucleotide sequence of a first 
sample of nucleic acid comprising the steps of: 

(a) forming a population of nucleic acid 
10 fragments bearing the said nucleotide sequence; 

(b) combining the population of nucleic acid 
fragments .with a population of a synthetic nucleotide 
construct which construct is capable of forming 
duplexes with the nucleic acid fragments, the sequence 

15 of the construct being such that duplexes of different 
molecular conformation are formed between the construct 
and the nucleic acid fragments dependent upon the 
presence of a polymorphism at a known variable 
nucleotide or sequence of nucleotides within the 

20 sequence of the sample under examination; 

(c) permitting duplex formation within the 
combined populations; and 

(d) separating the duplexes formed in step (c). 
The synthetic construct may contain one or more 

25 deliberate nucleotide substitution( s ) and/or 

deletion(s) and/or insertion(s) which is/are (i) 
opposite to a known variable nucleotide or sequence of 
nucleotides within the sequence under examination 
and/or (ii) contiguous with a nucleotide which is 

30 opposite to a known variable nucleotide or sequence of 
nucleotides within the sequence under examination. 

By the term "opposite", we mean that, in the 
resultant duplex between the nucleic acid fragment 
strand and the synthetic construct strand, the 

35 deliberate nucleotide substitution( s ) and/or 
deletion(s) and/or insertion(s) is/are in the 
position(s) in the synthetic construct strand 
corresponding to the variable nucleotide(s) under 
examination. The "contiguous" portion of the synthetic 

40 construct strand may be a single nucleotide in contact 
with the nucleotide opposite the variable nucleotide(s) 
or may be a nucleotide sequence, normally no more than 
20 bases in length, in which one of the end nucleotides 
is in contact with the nucleotide opposite the variable 

45 nucleotide( s ) . 

For the avoidance of doubt, it should be 
understood that, in the present invention, the 
nucleotide substitution( s ) and/or deletion(s) and/or 
insertion(s) contained in the sequence of the construct 

50 are made either (i) relative to the wild type of the 
nucleotide sequence under examination so that, in the 
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duplexes formed between the nucleic acid fragments and 
the construct, there Is created a deliberate or 
controlled mismatch when the nucleotide sequence 
contains a mutation from the wild type or (ii) relative 
5 to a mutant of the nucleotide sequence under 

examination so that, in the duplexes formed between the 
nucleic acid fragments and the construct, there is 
created a deliberate or controlled mismatch when the 
nucleotide sequence is either wild type or another 

10 (different) mutant. 

Preferably, the nucleotide sequence of the 
synthetic construct at least contains deliberate 
nucleotide substitution( s ) and/or deletion(s,) and/or 
insertion(s) contiguous with the nucleotide which is 

15 opposite the known variable nucleotide or sequence of 
nucleotides. It may also contain deliberate nucleotide 
substitution(s) and/or deletion(s) and/or insertion(s) 
opposite the known vardable nucleotide or sequence of 
nucleotides. 

20 Preferably, the synthetic construct includes 

nucleotide substitution( s ) and deletion(s) both 
opposite and contiguous with the mutation. In 
practice, it has been found that a sequence of between 
one and ten substitutions/deletions adjacent the 

25 position of the polymorphism in the nucleotide sequence 
under investigation is satisfactory for the purposes of 
this invention. 

In order simultaneously to identify more than one 
polymorphism in a given nucleotide sequence in a single 

30 or reduced number of experiments, the sequence of the 
construct is such that duplexes of different molecular 
conformation are formed between the construct and the 
nucleic acid fragments dependent upon the presence or 
absence of polymorphisms at two or more known variable 

35 nucleotides and/or sequences of nucleotides within the 
sequence of the sample under examination; it is 
therefore preferred that the synthetic construct 
includes deliberate nucleotide substitution(s) and/or 
deletion(s) and/or insertion(s) corresponding to at 

40 least two known polymorphisms in the nucleotide 

sequence, and preferably corresponding to all of the 
known polymorphisms in the nucleotide sequence. 

The extent of deliberate nucleotide 
substitution(s) and/or deletion(s) and/or insertion! s) 

45 should not be so great that duplex formation between 
the nucleic acid fragments and the construct is not 
possible. 

" The construct may also include a deliberate 
deletion or a series of deletions, at a position away 



Wa 93/19201 



PCT/GB93/006I6 



-6- 

from polymorphic sites in the nucleotide sequence under 
investigation (this is in addition to any deletion 
adjacent to a mutation site) ; this has the effect of 
permitting better electrophoretic separation of 
5 heteroduplexes from homoduplexes. 

The results of the separation conducted in step 
(d) of the method of this invention may be compared 
with the results of one or more similar separations 
conducted on duplexes formed using steps (b) to (d) 

10 above with the same synthetic nucleotide construct and 
a population of nucleic acid fragments bearing a 
nucleotide sequence corresponding to the said 
nucleotide sequence under examination, but taken from 
one or more different samples of DNA or RNA. This 

15 enables polymorphisms in any of the nucleic acid 
samples to be detected and compared. 

Typically, the nucleic acid under examination will 
be DNA, normally mammalian (such as human) genomic DNA. 
In its broadest aspect, however, the nucleic acid may 

20 be DNA or RNA from any source, genomic or otherwise, 
for example, a virus, bacterium, plant or animal 
source. 

The synthetic nucleotide construct will normally 
be a synthetic DNA, because this will form the most 

25 stable duplexes with DNA fragments containing the 
nucleotide sequence under investigation. 

Where the nucleic acid under examination Is 
mammalian genomic DNA, a sample of the DNA is obtained 
from an individual or other object whose genotype for a 

30 specific characteristic it is wished to study. (The 
term "individual" is intended to include a foetus.) 
DNA can be extracted from all nucleated cells. 
Typically, the DNA is obtained from peripheral blood 
cells for convenience. Foetal DNA can be obtained from 

35 placental cells or amniotic fluid. Other sources of 
DNA Include hair follicles, mummified bodies, etc. 



wo 93/19201 



PCr/GB93/00616 



-7- 

The DNA may be isolated by any appropriate method^ 
for example by the rapid salting out method described 
by Miller et al (Miller, S., Dykes, D. and Polesky, H. 
(1988) "A simple salting out procedure for extracting 
5 DNA from human nucleated cells"; NucL Acids Res. 16:1215) • 
Alternatively, the DNA may be isolated as cDNA from 
mRNA by reverse transcription. 

The nucleotide sequence contains the polymorphism 
under examination; thus, the sequence will include at 

10 least that part of the relevant gene containing the 

polymorphism. This will normally be a part of the gene 
ultimately coding for a region of the functional 
protein (i.e. exon), but may alternatively or 
additionally be an intervening part of the gene which 

15 is not responsible for coding the functional protein 
(e.g. an intron) but which nevertheless affects 
transcription and translation or is known to be in 
genetic linkage with specific polymorphisms in coding 
sequences of the gene. Since two or more loci coding 

20 for different forms of the same protein - or 

representing pseudogenes - may be present (as for the 
DRB genes of the HLA-DR loci) so there may be other 
forms of the nucleotide sequence under investigation 
and t:hese ot:her forms of the nucleotide sequence may be 

25 formed into a population of nucleic acid fragments 

which can participate in the duplex formation reaction 
of step (c) - for example see Bidwell, JL and Hui, KM 
(1990). 

The population of nucleic acid fragments bearing 
30 the nucleotide sequence under examination is formed 

from the first sample of nucleic acid by a method which 
preserves the relevant sequence information. It is 
presently preferred that the method used is one which 
directly amplifies the nucleotide sequence in the 
35 sample of DNA or RNA (which may for example be isolated - 
directly from the organism under investigation or may 
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be cDNA derived from mRNA by reverse transcription) to 
result in a population of nucleic acid fragments 
bearing the nucleotide sequence. Depending upon the 
method of amplification employed, the resulting nucleic 
5 acid fragments may be RNA or DNA. However, they are 
preferably DNA, formed from a DNA sample using an 
enzymatic amplification technique such as the 
polymerase chain reaction (PCR). The technique of 
"run-off" transcription (Green, MR, Maniatis, T, 
10 Melton, DA (1963) Human beta-globin pre-mRNA 

synthesised in vitro is accurately spliced in Xenopus 
oocyte nuclei. Cell 32:681-694) may be used to prepare 
a population of RNA fragments from a DNA sample. 

The preferred method for forming from a DNA source 
15 a population of DNA fragments bearing the nucleotide 
sequence is the polymerase chain reaction (PCR). In 
this procedure, -Cwo oligonucleotide primers are added 
to the DNA source for annealing to complementary 
sequences at either end of the nucleotide sequence, a 
20 heat-stable DNA polymerase, such as Taq polymerase, 
dATP, dCTP, dGTP and dTTP and appropriate biological 
buffers and cof actors. The DNA is denatured, the 
oligonucleotide primers anneal to their complementary 
sequences with the 3' ends pointing towards each other 
25 and the DNA polymerase results in extension of the 
annealed primers and an^lification of the segment of 
DNA defined by the 5' ends of the primers. 

Where the synthetic construct is a DNA construct, 
the population of the synthetic DNA construct may be 
30 formed using the PCR, preferably employing the same 
primers as used for amplification of the nucleotide 
sequence under consideration although different primers 
could be used so long as the resulting amplified 
construct is capable of forming the necessary duplexes 
35 with the nucleic acid fragments. 

In the PCR, the cycle of DNA denaturation, primer 
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anneallng and synthesis of the DNA segment defined by 
the 5* ends of the primers is repeated as many times as 
is necessary to amplify the nucleotide sequence under 
consideration (or the construct) until sufficient is 
5 available for step. (c) of the present method. 

Amplification may proceed for from 20 to 40 cycles, for 
example from 25 to 35 cycles. Amplification of the 
nucleotide sequence under investigation and the 
construct may be conducted simultaneously in a mixed 

10 PGR or in separate PCRs. 

Any appropriate oligonucleotide primers may be 
employed. The primers should be suitable for 
amplification of specific locus under investigation. 
The primers may be labelled for facilitating the 

15 comparison between the results of the separation in 
step (ill) of ^he present method and the separation 
conducted on the duplexes formed in the second reaction 
on the second sample of DNA. The primers can be 
labelled with a directly detectable tag, for example a 

20 radionuclide such as ^^P or ^^S, a fluorescent compound 
such as fluorescein, an enzyme such as a horse radish 
peroxidase or alkaline phosphatase, with biotln or 
digoxigenln* The two primers may have the same or 
different labels. 

25 Before formation of a population of the construct 

(for example by using the PGR), the construct roust 
first be synthesised. Typically, the construct is 
synthesised from multiple synthetic oligonucleotides, 
for example. Once synthesised, it may be cloned in a 

30 suitable host, for instance an M13 phage. 

The amplified nucleic acid fragments and the 
amplified construct obtained from mixed PGR or separate 
PGRs are heated (mixed PGR) or mixed and heated 
(separate PCRs) to permit separation of the amplified 

35 product into single stranded nucleic acid which is then 
allowed to re- anneal under suitable annealing 
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conditlons to form the duplexes. Duplexes will be 
formed between the coding and non-coding strands 
present in the reaction vessel and, in particular, a 
proportion of the duplexes so formed will be mismatched 
5 hybrids between sense and anti- sense strands of 
partially homologous sequences ("heteroduplexes" 
formed either in cis or trans or in both cis and trans. 

The heteroduplexes formed are separated according 
to their- molecular conformation which affects their 

10 apparent, but not actual, molecular weight. This may 
be achieved by, for example, electrophoresis. The 
separation is typically effected on a gel which does 
not fully denature the nucleic acid, such as a non- 
denaturing polyacrylamide gel. Electrophoresis is 

15 conducted under conditions which effect a desired 
degree of resolution of the duplexes. A degree of 
resolution that separates duplexes that differ in 
"apparent size" - resulting from their different 
molecular conformations - by as little as about 10 bp 

20 is usually sufficient. The heteroduplexes can be 

"shifted" away from the homoduplexes in the separation 
by Introducing one or more deliberate deletions in the 
construct as described above; this has the effect of 
retarding the mobilities of all heteroduplexes within 

25 the gel. Size markers may also be run on the gel to 
permit estimation of the apparent size of duplexes. 

The distribution, i.e. the resolution pattern, of 
the heteroduplexes will be allele-specific. This 
resolution pattern or PGR fingerprint can next be 

30 visualised. Where a PCR primer has been labelled, this 
label may be revealed. A substrate carrying the 
separated labelled duplexes Is contacted with a reagent 
which detects the presence of the label* Where the PCR 
primers were not labelled, the substrate bearing the 

35 PCR fingerprint may be contacted with, for example, 
• ethldium bromide and the nucleic acid fragments 
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visuallsed under ultraviolet light; alternatively, the 
heteroduplexes may be visualised with silver staining. 

The relative molecular conformation of the 
duplexes is thus determined, in this embodiment, as a 
5 pattern of bands representing the duplexes which 

migrate different distances from the origin on the gel, 
dependent on their relative molecular conformation* 
This is compared with the relative molecular 
conformation of duplexes resulting from a similar 

10 exercise (using the same construct) conducted on a 

nucleotide sequence of a second sample of nucleic acid. 
In this way, correspondence between the nucleic acid 
samples may be obtained. It may be ascertained whether 
the relative molecular conformation of the duplexes 

15 obtained, and therefore whether the nucleotide 
sequences of the two samples are the same or not. 

The relative molecular conformation in respect of 
the second sample of the nucleic acid (typically DNA) 
may be obtained using the same conditions as are 

20 employed to obtain the relative molecular conformation 
in respect of the first nucleic acid sample. Typically 
the same primers are used. PGR amplification need not 
necessarily be for the same number of cycles or under 
identical reaction conditions, though. Similarly, 

25 separation of the resulting duplexes need not be 
carried out in an Identical fashion provided it is 
possible to assess the relative correspondence of the 
molecular conformation of the duplexes resulting from 
amplification of each sample. 

30 The second sample of nucleic acid may be analysed 

according to the present method simultaneously with or 
at a different time to analysis of the first nucleic 
cid sample. Indeed, a multiplicity of samples may be 
analysed. Typically the or each sample is a selected 

35 sample. Samples from selected Individuals can be 
analysed. The relative molecular conformation 
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determined for each sample may be held in a computer. 

A computer database may therefore be generated 
containing the relative molecular conformation 
distribution patterns for different samples. 
5 The PGR fingerprint using the method of the 

invention may be compared with another PGR fingerprint 
to determine whether the individuals^ whose DNA has 
been tested to obtain the two fingerprints, match in 
respect of the particular nucleotide sequence under 

10 investigation. The present method can therefore be 
applied to DNA samples from two or more Individuals. 
Alternatively, a PGR fingerprint may be compared with a 
standard or reference fingerprint previously obtained. 
Correspondence between fingerprints can therefore be 

1 5 determined . 

The present method can be used for example to 
determine whether a donor of a transplant or 
transfusion and a recipient or proposed recipient of 
the transplant or transfusion have matching allotypes. 

20 PGR fingerprints from the donor and the recipient or 

proposed recipient can be compared. The transplant may 
be a -tissue transplant such as a heart, lung, liver or 
kidney transplant or a bone marrow transplant. The 
transfusion may be a blood transfusion. Matching of 

25 living related or unrelated donors for allogeneic 
transplantation may therefore be achieved. 

In particular, the method can be used in matching 
individuals in respect of one or more of the functional 
proteins expressed by the HLA (human leucocyte antigen) 

30 class I and II genes, for example the Class II DPB, 

DQB, DQA and DRB genes which all exhibit polymorphism. 
This may be used for "typing" individuals and 
"matching" a patient to a donor in a transplant 
operation. Where the DPB gene is under investigation, 

35 it has been found effective to analyse the second exon 
of the gene which is the known polymorphic sequence of 
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the gene. A DNA construct for use in the method of the 
invention for analysis of the DPBl second exon should 
preferably have a nucleotide sequence substantially 
corresponding to the invariable regions of the gene and 
5 nucleotide substitutions at the knovm variable 

nucleotides, for instance at one or more of positions 
56, 58, 59, 60, 65, 66, 138, 141, 198, 199, 201, 205, 
227, 239, 260, 262, 285, 288, 290, 293 starting from 
the 3'- end of primer UG19 (a knomi primer used to 

10 prime DNA polymerisation of the second exon of DPBl). 
In addition, it is preferred that the construct 
contains at least one deletion when compared with the 
gene sequence under consideration, for instance a 5 
base pair deletion at position 235 starting from the 

15 3'- end of primer UG19 (SEQ ID NO: 6). This is in 
order more effectively to separate the heteroduplexes 
from the horooduplexes. 

Thus, whilst extension of the heteroduplex 
principle as described by Bidwell and Hui (1990), supra 

20 to HLA-DP matching is not directly possible because for 
DPBl -specific PGR primers there is generally only a 
single PGR product, nevertheless it is possible with 
the method of the present invention to generate PGR 
fingerprints for the DPBl locus by permitting 

25 heteroduplex formation between the population of DNA 
fragments bearing the nucleotide sequence under 
examination with a synthetic molecular construct (such 
as that described above) which contains at least one 
deliberate nucleotide substitution, deletion or 

30 insertion, designed to be opposite and/or contiguous 
with a hypervariable nucleotide within, for this 
example, polymorphic second exons of the target DPBl 
genes. These constructs may be used following PGR 
reactions in order to produce heteroduplexes specific 

35 for individual DPBl alleles, and thus to allow rapid 
matching. We have constructed and cloned a construct 
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( see Figure 1) specific for the DPBl locus which 
discriminates between the 21 specificities DPB1*0101 to 
DPB1*2001 (Fig 2). In practise, DPBl second exon 
sequences are amplified from genomic DNA in the normal 
5 manner. In a separate PGR and using the same primers, 
the synthetic "DPB construct", or universal 
heteroduplex generator (UHG), is selectively amplified 
from an M13mpl8 phage clone. Next, aliquots from the 
two separate reactions are mixed together and subjected 

10 to 2-3 further cycles of PGR, in order to allow 

heteroduplex formation. Finally, PGR fingerprints are 
examined in the normal way following polyacrylamide gel 
electrophoresis. Fig 3 shows the results obtained for 
a selection of DPBl HTCs. In DPBl heterozygotes, some 

15 naturally occurring heteroduplexes formed in trans 

might be expected, but in practice not all combinations 
of alleles product heteroduplexes without use of the 
method of the present invention employing a DPB 
construct (Fig 4a). However, conducting the method 

20 with an amplified DPB construct yields highly 

discriminatory PGR fingerprints of heteroduplexes (Fig 
4b), analogous to the DR-Dw system. Newly described 
alleles (for example, DPB1*2101-3601 ) may be catered 
for by alteration to one or more nucleotides within an 

25 existing construct* It is envisaged that this method 
will be particularly appropriate to the DQAl and DQBl 
loci, and in conjunction with group-specific PCR-SSP, 
to problematic areas in DRBl, for example DR4 subtype 
matching. 

30 The present method can also be used in determining 

whether an individual is susceptible to or has a 
disease associated with a particular nucleotide 
sequence which codes for a functional protein. Such 
diseases are reviewed in Immunol. Rev. 70, 1-218, 1983. 

35 In a method in accordance with the present 

invention for the diagnosis of a genetic disease, a 
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suitable synthetic construct (usually a DNA construct) 
is employed which is capable of forming duplexes with a 
nucleotide sequence of (a) a translated or non- 
translated exon region, or (b) an intron region which 
5 affects expression or mRNA slicing, the nucleotide 
sequence containing a mutation responsible for a 
genetic disease, the construct containing one or more 
deliberate nucleotide substitution(s) and/or 
deletion(s) and/or insertion(s) which is/are (i) 

10 opposite to a known variable nucleotide or sequence of 
nucleotides within the sequence under examination 
and/or ( li ) contiguous with a nucleotide which Is 
opposite to a known variable nucleotide or sequence of 
nucleotides within the sequence under examination. DNA 

15 is then Isolated from the individual under diagnosis, 
for example from whole blood obtained according to 
standard protocols or from dried blood spots or from 
any other suitable source of DNA. Populations of the 
chosen synthetic construct and the nucleotide sequence 

20 are formed, for example using the polymerase chain 

reaction and these populations are then combined under 
suitable conditions to permit duplex formation within 
the combined populations . The duplexes formed are then 
separated, for example using gel electrophoresis, and 

25 the duplexes visualised, for example by staining. The 
resulting profile is then compared with a standard 
profile representing the results obtained using the 
same synthetic construct either with a DNA sample known 
to contain the mutation under examination or a DNA 

30 sample known not to contain the mutation under 

examination. This enables the user of the method 
readily to identify whether the individual under 
diagnosis carries the mutation responsible for the 
genetic disease. 

35 The present Invention Is particularly suitable 

where the relevant gene is multiallelic but is not 
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duplicated within the genome and therefore DNA 
heteroduplex formation following the polymerase chain 
reaction (PGR) is possible only in 'trans' and 
therefore only in heterozygotes for a given gene. In 
5 accordance with the present invention, DNA 

heteroduplexes can be generated by hybridising PGR 
products with synthetic DNA molecules. These are PCR- 
amplif iable DNA sequences that mimic a DNA sequence but 
differ from it as a result of controlled nucleotide 
10 substitution(s), deletion(s) or insertion(s) at sites 
continuous to known mutation sites within the DNA. The 
construct and DNA sequences are amplified with the same 
PGR primers, and hybridised together post-PCR by 
heating and slow cooling, resulting in DNA 
15 heteroduplexes having different conformational forms, 
and thus different electrophoretic mobilities, unique 
for individual alleles of a gene (Bidwell et al, 1992), 

The use of the present method is particularly 
suitable in, for example, phenylketonuria geno typing, 
20 Phenylketonuria (PKU) is probably the most common 
disorder of amino acid metabolism and results from 
mutations at the phenylalanine hydroxylase (PAH) gene* 
The gene consists of 13 exons and encodes a MRNA of 2.8 
kilobases (Kwok et al, 1985; DlLella et al, 1986). To 
25 date, more than 50 mutations have been described at the 
PAH gene but there are wide variations in the frequency 
distribution of individual mutations between different 
populations (Scriver et al, 1992). In the south west 
of England, three mutations in exon 12, namely 
30 IVS12ntl, R408W and Y414C, account for just over 40% of 
PKU chromosomes (Tyfield et al, 1991). In vitro 
expression analyses reveal 100% loss of PAH activity 
associated with IVS12ntl and R408W mutations, and 50% 
reduction in activity for the Y414C mutation (Okano et 
35 al, 1991a, b). Within this population, other mutations 
found In the PAH gene are In exon 3 (I65T) and intron 
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10 ( IVS10nt546 ) , resulting in 74% and 100% loss of PAH 
activity respectively (John et al, 1992; Dworniczak et 
al, 1991). 

A significant proportion of early treated young 
5 adult PKU sufferers show clinical neurological 

deterioration (Thompson et al, 1991; Lou et al^ 1992), 
and thus it is important to research the possible link 
between specific PAH gene mutations and long term 
prognosis. Because of the large number of disease- 

10 associated mutations at this locus it is desirable to 
have available a mutation detection system which is 
rapld^ safe and reliable, and is able to screen samples 
for several mutations at one time. 

Several techniques of mutation detection have been 

15 described, including those that depend on 

conformational polymorphisms in amplified single 
stranded DNA (SSCP) (Orita et al,1989), chemical 
cleavage of mismatched DNA sequences in amplified DNA 
(Cotton et al, 1988), and those that depend on 

20 differences in elect rophoretic melting properties 

between amplified heteroduplexes and homoduplexes in 
denaturing or thermal gradient gels (DGGE and TGGE) 
(Fisher and Lerman, 1983). However, none of these 
techniques is without some limitation regarding safety, 

25 reliability or universal adaptability. 

In a method in accordance with the present 
invention for the diagnosis of a PKU, it has been found 
to be effective in many cases to analyse exon 12 of the 
PAH gene which contains at least five known mutation 

30 sites leading to full or partial inactivation of the 
PAH gene product. Thus the DNA construct should 
preferably have a nucleotide sequence substantially 
corresponding to an invariable region or regions of the 
nucleotide sequence of exon 12 of the gene, with at 

35 least one nucleotide substitution, deletion or 
insertion at or adjacent the known variable 
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nucleotides. Thus, for instance, the DNA construct may 
have such a deletion, substitution or insertion at or 
adjacent to one or more of the following four 
positions, defined by reference to the specific codons 
5 of the PAH exon 12 wild type gene: 

the 1st nucleotide of codon 408; 
the 2nd nucleotide of codon 413; 
the 2nd nucleotide of codon 414; and 
the 1st nucleotide of codon 418; 
10 and may also have such a deletion, substitution or 

insertion at or adjacent the position corresponding to 
the 1st nucleotide of the PAH intron 12 wild type gene. 
Preferably, deletions and substitutions are employed. 
The invention also provides a test kit, which kit 
15 comprises: 

(a) two oligonucleotide primers suitable for use 
in PCR and capable of annealing to complementary 
sequences at respective ends of a nucleotide sequence 
to be examined; 
20 (b) a synthetic nucleotide (preferably DNA) 

construct which construct is capable of forming 
duplexes with the nucleotide sequence under 
consideration, the sequence of the construct being such 
that duplexes of different molecular conformation are 
25 formed between the construct and the nucleotide 

sequence under examination dependent upon the presence 
of a polymorphism at a known variable nucleotide or 
sequence of nucleotides within the sequence under 
examination; and 
30 (c) a control DNA and/or control PCR 

amplification product. 

The synthetic nucleotide construct may contain one 
or more deliberate nucleotide substitution(s) and/or 
deletion{s) and/or insertion(s) which is/are (i) 
35 opposite to a known variable nucleotide or sequence of 
nucleotides within the sequence under examination 
and/or (11) contiguous with a nucleotide which is 
opposite to a known variable nucleotide or sequence of 
nucleotides within the sequence under examination; 
40 The primers may be labelled as above. The control 

PCR amplification product may also be labelled. The 
kit may further comprise one or more of the following: 
a heat- stable DNA polymerase; 
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DATP, DCTP, DGTP and DTTP; 

appropriate biological buffers and cof actors; 
a database comprising the relative molecular 
conformation and gel mobilities of DNA 
5 fragments generated by PCR amplification and 

heteroduplex formation between selected DNA 
samples and the construct. 
For a better understanding of the present 
invention, and to show how the same may be carried into 
10 effect, reference will now be made to the accompanying 
drawings, in which: 

Figure 1 shows a "consensus" sequence for the 
second exon of the DPB-1 gene and a 5'- flanking 
(introh) region and the corresponding sequence of a 
15 synthetic DNA construct for use in a method in 

accordance with this invention. The sequence of the 
synthetic DNA construct is SEQ ID NO: 1 in the attached 
sequence listing. 

Figure 2 shows a PCR fingerprint for HLA-DPBl 
20 alleles produced in accordance with the present 

invention using a construct having the sequence shown 
in Figure 1; 

Figure 3 shows the PCR fingerprints for HLA-DPBl 
homozygous typing cells using the method of the present 
25 invention and the construct having the sequence shown 
in Figure 1; 

Figure 4a shows the PCR fingerprints of HLA-DPBl 
heterozygotes, without use of a construct; 

Figure 4b shows corresponding PCR fingerprints to 
30 those shown in Figure 4a, but using the construct of 
Figure 1 in the PCR fingerprinting method; 

Figure 5 shows the nucleotide sequence of a 
portion of the normal human PAH gene, including exon 12 
and part of the 5' and 3* flanking introns, as well as 
35 5 genotypes including known PKU mutations and a DNA 
construct for use in identifying those mutations; 
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Figures 6a, 6b, 7 and 6 show the stained gels for 
a series of experiments on the of PAH gene exon 12; and 

Figures 9(a), 9(b) and (c) model the homoduplexed 
and synthetic construct-heteroduplexed nucleotides 
5 between codons 407 and 410 of exon 12 of the PAH gene 
using in the model the construct sequence which 
identifies the R408W mutation. 

EXAMPLE 1 

A DNA construct specific for the DPBl locus was 
10 synthesised from multiple synthetic oligonucleotide 
"longmers", each of about 100 nucleotides in length, 
using techniques known in the art. In this example, 
four longmers were employed having the sequences 
identified SEQ id NO: 2, SEQ ID NO: 3, SEQ id NO: 4 and 
15 SEQ ID NO: 5. The DNA sequence of the construct was 
chosen to overlap the start of the second exon of the 
DPBl gene, the nucleotide sequence of which is known 
from Marsh, S6E and Bodmer, JG. (1991) "HLA Class II 
nucleotide sequences, 1991"; Tissue Rntigens, 37: 181- 
20 189, and contained deliberate nucleotide substitutions 
and a deletion designed to be contiguous with known 
hypervariable nucleotides within the sequence of the 
second exon of the DPBl gene. The construct was 338 
base pairs in length. The construct sequence included 
25 29 mismatches and 5 unmatched bases between the 

construct and a "consensus" sequence for the second 
exon and 3' flanking intron of the DPBl gene, the 
consensus sequence being determined by using a computer 
alignment program, MicroGenie (Version 7.01), Beckman 
30 Instruments. The construct was cloned in an M13mpl8 
phage. 

The DPBl "consensus" sequence is the top line in 
Figure 1. This "consensus" sequence was determined 
using a computer alignment program and variable bases 
35 are identified by the letter N. This means that these 
particular bases in the DPBl second exon vary between 
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individuals and give rise to the various allelic forms 
of the gene. For simplicity, the 5 '-3' coding strand 
of the consensus sequence only is shovm. 

The bottom line of Figure 1 (also SEQ ID NO: 1) is 
5 a DNA construct designed to contain deliberate 

nucleotide substitutions and deletions contiguous with 
the known variable nucleotides within the nucleotide 
sequence (here the second exon of the DPBl gene) under 
examination. Again, the 5 '-3' coding stand is shown 
10 for simplicity. 

As will be noted, the construct is designed to 
have an identical sequence with the great majority of 
the non-variable nucleotides of the DPBl consensus 
sequence, in order that duplexes will form between the 
15 DPBl DNA fragments from the sample of genomic DNA under 
consideration and the construct. In addition, at the 
known variable nucleotides of the DPBl sequence, 
deliberate nucleotide substitutions (sometimes 
herelnbelow referred to as "identifiers" as they 
20 "identify" any mutation at this position) are created. 
Each of these deliberate substitutions Is marked by an 
asterisk. The substitutions are chosen such that, in 
different allelic forms of the gene, a different extent 
of mismatch will exist. In addition, adjacent the 
25 known variable nucleotides, it may be desirable to 
create a deliberate mismatching with a conserved 
nucleotide; these are underlined in the construct 
sequence and effectively amplify the adjacent 
mismatches. As a result, herelnbelow, these are 
30 occasionally referred to as "amplifiers". The DNA 

construct shown has a number of different mismatches at 
different variable nucleotides of the consensus 
sequence and, by careful choice of the mismatches, 
different numbers and extent of single stranded loops 
35 will be formed in the duplex between the nucleotide 
sequence under Investigation and the construct, for 
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different alleles. In addition, a deliberate deletion, 
for example the 5 base pair deletion in the construct 
shovm by the "A" in Figure 1 ensures that all duplexes 
fonned between the genomic DNA derived from the sample 
5 and the construct will migrate at a significantly 
different rate to the duplexes formed within the 
population of genomic DNA. This "shifts" the bands 
derived from duplexes including the construct away from 
bands derived from duplexes not including the construct 

10 in the PGR fingerprint produced in accordance with the 
present invention. 

The procedure to produce a PGR fingerprint for the 
DRBl loci was as follows. 

The DBF nucleotide sequences under investigation 

15 (second exon) were amplified in the polymerase chain 
reaction from genomic DNA. The PGR primers employed 
were those described by Begovich etal (Begovich AB, 
Bugawan TL, Nepora BS, Klitz W, Nepom GT and Erlich HA 
(1989), "A specific HLA-DPB allele is associated with 

20 pauciarticular Juvenile rheumatoid arthritis but not 

adult rheumatoid arthritis", Proc Natl Acad Sci USA 86:9489- 
9493) and which are identified as U619 (SEQ ID NO: 6) 
and UG21 (SEQ ID NO: 7), available from suppliers of 
custom oligonucleotides. The reaction mixture 



25 contained: 

Genomic DNA Q O.lpg/^il S.Ojil 

UG19 primer (5pM stock) lO.Ojil 

UG21 primer (5pM stock) lO.Ojil 

lOX PGR buffer (supplied) lO.Ojil 

30 dNTP mix. (lOmM .each DNTP) 2.0pl 

ddH20 62.5U1 



The resulting mixtures were combined^ vortexed and 
then centrifuged briefly. The genomic DNA was 
denatured by heating the tube at 94 *G for 5 min and 
35 then snap cooled on ice for 3 min and overlaid with 
75iil of paraffin oil. 
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PCR was initiated using a "hot- start" procedure, 
whereby tubes containing the reaction mixture without 
Tag polymerase were equilibrated at 72** C in a heat 
block, rag polymerase (2,5 units, O.Spl) was then 
5 added through the mineral oil, and the tube was 
vortexed again, recentrifuged briefly and samples 
immediately transferred to the thermal cycler, 
previously set at 72 "C. The genomic DNA samples were 
then subjected to 35 PCR cycles, as follows. 
10 primer extension: 72 *C, 30 sec 

denaturation: 94 "C, 30 sec 

primer annealing: eS'C, 60 sec 
The 72 *C primer extension time at the end of the final 
cycle was increased to 10 min. 
15 The DHA construct was amplified from the M13mpl8 

clone using the same primers as for the genomic DNA 
under study, the reaction mixture being as follows: 

DPB-UHG M13mpl8 DNA: 1/100 dilution from 



supplied stock 1.0]xl 

20 UG19 primer (SjiM stock) lO.Opl 

U621 primer (5^M stock) lO.Opl 

lOX PCR buffer (supplied) lO.Opl 

dNTP mix (lOmM each dNTP) 2.0vil 

ddH20 66.5iil 



25 The same PCR and thermal cycling conditions were used 
as above for the genomic DNA, but with only 25 cycles. 

Following the PCR of the construct, 5|il of PCR 
product was added to Ipl of 6X gel loading buffer and 
electrophoresed for 40 min at 70 volts in a 1.5% (w/v) 

30 agarose mlnigel in IX TAE running buffer containing 
0«5}ig/ml ethidium bromide (Sambrook J, Fritsch EF and 
Maniatis T (1989) Molecular cloning, A laboratory 
manual; Cold Spring Harbor Laboratory Press) (Sambrook 
et al 1989, Supra). The resulting bands were the 

35 examined over a UV transilluminator and compared with 
the relative intensities of sample and DPB-construct 
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PCRs and the concentrations adjusted to give the same 
intensity of fluorescence. Appropriate amounts (6- 
20pl) of both PGR products were added to a single 
microcentrifuge tube, and subjected to a further 3 PGR 
5 cycles as above. A 12\xl aliquot was analysed for 
heteroduplexes by polyacrylamide minigel 
electrophoresis as described for the Mini-Protean II 
gel electrophoresis system (Bio*Rad). In this 
technique, pour a 12% nondenaturing polyacrylamide 

10 (29:1) minigel, using IX TBE running buffer and then 
add 8-lOpl of PGR product to 2]xl 6X gel loading buffer 
containing bromophenol blue and xylene cyanole FF. 
Electrophorese at 200 volts for 90-95 min. The exact 
time will need to be determined experimentally: the 

15 homoduplex band should be allowed to migrate to within 
0.5cm of the end of the gel. Stain gels for 20 min in 
IX TBE containing 0. 5pg/ml ethidium bromide and examine 
using a UV transllluminator. 

Following this procedure PGR fingerprints for HLA- 

20 DPBl alleles were obtained (see Figure 2). 

For each HLA-DPBl allele listed (DPB1*0101 to 
2001), one of the two possible heteroduplexes formed 
with the DPB-construct is represented: single 
nucleotide mismatches are shown as boxes, matched 

25 regions as lines. For example in DPB1*0101, mismatches 
in complementarity between the coding strand of the 
DPB1*0101 (upper) sequence and the "noncoding** strand 
of the construct (lower) sequence are shown, and in 
each case the individual mismatched nucleotide on the 

30 DPB1*0101 sequence is specified (see key). Nucleotides 
are numbered from the 5 '-end of the DPB PGR primer UG19 
(SEQ ID NO: 6)« Black triangles represent deliberate 
deletions in the DPB-construct designed to increase gel 
retardation of the heteroduplexes and thereby to 

35 produce a clear spread of visible bands (see Figs. 3 
and 4b). The single DPB-construct employed 
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theoretically generates an unique heteroduplex with 
each DPBl allele. 

In addition, PGR fingerprints of HLA-DPBl 
homozygous typing cells (HTCs) conducted in the 
5 presence of the DPB-construct were obtained (see Figure 
3). In each case an unique PGR fingerprint is 
observed. Lane 11 shows an HLA-DP heterozygote : the 
pattern observed is in part a combination of the 
relevant HTG patterns (lanes 5 and 6). M, molecular 

10 weight marker (pUCBM21 Hpall plus Dral and Hlndlll 

digests: fragment sizes 1114^ 900, 692, 501, 489, 404, 
320, 242, 190, 147, 124, 110, 67, 37, 34, 34, 26 and 19 
base pairs). 

Finally, PGR fingerprints of HLA-DPBl 

15 heterozygotes were obtained. PCRs not conducted in the 
presence of the DPB-construct show (see Figure 4a) some 
rudimentary heteroduplexes formed in trans between 
alleles in some, but not all, combinations of alleles 
(lanes 3, 4, 6, and 7). The effect of conducting the 

20 PCRs as shown in (Figure 4a) in the presence of the 
construct is shown in Figure 4b. Unique PGR 
fingerprints are now revealed for all heterozygotes, 
with excellent resolution of heteroduplexes. Lanes 7 
and 8 show a leukaemic patient and potential donor, 

25 respectively, matched for HLA-A, B and DR-Dw but 

mismatched for DP as revealed by the PGR fingerprints. 
The patient (lane 7) is DPBl *0301, 0401 (compare with 
lane 3) and the potential donor (lane 8) is 
DPB1*0201,0401 (compare with lane 1). The mixed 

30 lymphocyte reaction (MLR) was positive in this case. 
M, molecular weight marker (see above). 

EXAMPLE 2 

This example illustrates the application of the 
present invention to the classification of 
35 phenylketonuria genotypes. 

A DNA construct (SEQ ID NO: 8) was synthesised to 
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identify five mutations (R408W, R413P, y414C, T418P and 
IVS12ntl) within exon 12 of the phenylalanine 
hydroxylase (PAH) gene, identifiers for four of the 
five mutations were incorporated within the construct 
5 and consisted of nucleotide substitutions or 

substitutions plus deletions. A specific identifier 
for the T418P mutation was not necessary. Figure 5 
shows the nucleotide sequence of a portion of the 
normal human PAH gene, including exon 12 and part of 
10 the 5' and 3' flanking introns. Sequences of the PKU 
point mutations and the DNA construct are shown aligned 
to the normal sequence: dashes indicate identity. 
Annealing locations for the PAH12L and PAH12R PCR 
primers (see below) within the PAH gene are underlined. 
15 The overlapping complementary regions of the two 

constituent longmers are denoted by «. The construct 
(which is the lower line of Figure 5 -PAH-UHG and which 
is SEQ ID NO: 8 in the attached sequence listing) was 
synthesised from two synthetic oligonucleotides 
20 ('•longmers"): PAH-LEFT #330 (sense strand), a 95mer 
(SEQ ID NO: 9), and PAH-RIGHT #335 (antisense strand), 
a 97mer (SEQ ID NO: 10), with a region of 3'- 
compleraentarity of 16 bases (Fig 5). Sequences of the 
longmers were: 
25 PAH-LEFT #330: 

5'-TGT (5GT TTT 6GT CTT AG6 AAC TTT GCT GCC ACA ATA 
CCT CAC CTT CTC AGC CG6 CTA CAT ACA CCC AAA GGA 
TTG AGG TCT TG6 ACA ATA CCC AG-3 ' ; 
and PAH-RIGHT #335: 
30 5'-AGT CTT CGA TTA CTG AGA AAC CGA GTG GCC TC6 TAA 

GGT GTA AAT TAA GCT GTT AAT GGA ATC AGC CfJi AAT 
CTT AAG CTG CTG GGT ATT GTC CAA G-3 ' . 
These were synthesized on an ABI 391A DNA 
synthesizer using 40 nmole polystyrene support columns 
35 (Applied Biosystems). Following synthesis, the two 
longmers were Joined to form a double stranded 
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construct as follows. Reaction mixtures were set up 
containing 2.5pl of a 5pM aqueous solution of each 
longmer, 5yl dNTP mix (dATP, dCTP, dGTP and dTTP each 
at a concentration of lOmM), Ipl of lOmg/ml bovine 
5 serum albumin^ lOpl Vent™ DNA polymerase reaction 
buffer (New England Biolabs), 58pl deionized double 
distilled water, 2 units Vent™ DNA polymerase (New 
England Blolabs), added at 72 'C to hot-start the 
reaction (D'Aguila et al, 1991), and O.Spl E. coli 

10 single stranded DNA binding protein (Bind Aid™, United 
States Biochemicals ) • Mixtures were subjected to 
heating at 94 'C for 1 min and cooling at 37 'C for 1 
min. The temperature was Increased to 72 *C and lOpl 
each of 5pM solutions of exon 12 PCR primers were 

15 added. The sequences of these primers were: 

left (PAH12L), 5'-TGT GGT TTT GGT CTT AGG AA-3 ' 
(SEQ ID NO: 11); and 

right (PAH12R), 5'-AGT CTT CGA TTA CTG AGA AA-3' 
(SEQ ID NO: 12). 

20 Thermal cycling conditions were as for genomic -DNA 

amplification (see below). The products of six 
replicate reactions were pooled and electrophoresed on 
two 8% w/v nondenaturing polyacrylamlde gels (Protogel, 
National Diagnostics) as previously described (Bidwell 

25 and Hui, 1990). Gel slices containing the double 
stranded construct band were excised and the DNA 
extracted by Isotachophoresis (Ofverstedt et al, 1984), 
precipitated with ethanol, washed in 70% ethanol, and 
redissolved in 50^1 sterile double distilled water. 

30 Ten microlitres of a 1:10,000 serial dilution was 
amplified for use in the heteroduplex analyses. 

Genomic DNA was isolated from whole blood 
according to standard protocols, or alternatively from 
dried blood spots used in neonatal screening tests, as 

35 follows: a 2mm^ area of blood spot was excised from the 
paper, added to a 0.5ml microcentrifuge tube containing 
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25\xl of a 1:10 dilution of AmpliTaq™ reaction buffer 
(Perkin Elmer Cetus). Samples were heated at 96 for 
15 mln, cooled, diluted with a further 25yl of IX 
ArapliTaq™ reaction buffer, mixed and centrifuged at 
5 13000g for 15 min. Supernatants were used directly in 
the PGR. Separate UHG and genomic DNA PGR 
amplifications were performed in lOOpl reaction 
mixtures containing lOjil heat-denatured diluted UHG or 
genomic DNA isolated from whole blood (lOOng) 

10 respectively, lOpl each of 5|iM solutions of PAH12L and 
PAH12R primers (details above), 2]xl dNTP mix (dATP, 
dCTP, dGTP and dTTP (Boehrlnger Mannheim) each at a 
concentration of lOmM), 58]xl deionlzed double distilled 
water^ lOul AmpliTaq™ reaction buffer and 2 units 

15 AmpliTaq™ DNA polymerase (Perkin Elmer Cetus). The 
enzyme was added at 72 'C to hot- start the reaction 
(D'Aquila et al, 1991). For PGR amplifications of DNA 
obtained from dried blood spots, 30pl of supernatant 
was used In the reaction mixtures, and the volumes of 

20 AmpliTaq™ reaction buffer and water were reduced to 
7pl and 40.6pl respectively. The reaction mixtures 
were subjected to 35 rounds of thermal cycling at 94 'C 
for 1 min, 48 'C for 1 min and 72 'C for 1 min. 

Fifteen microlltres of PCR-ampltfied genomic DNA 

25 and of PCR-amplified construct were mixed and subjected 
to 3 rounds of thermal cycling at 94'C for 1 min, 65 'C 
for 2.5 min and 72 'C for 2.5 min (Note: 1 amplification 
of the construct was therefore sufficient for 6 
reactions with POt-amplifled genomic DNA). The entire 

30 mixture was subjected to electrophoresis for 90 min at 
200V on 12% w/v nondenaturing polyacrylamide minlgels 
(Protogel, National Diagnostics) and heteroduplexes 
were visualised by staining with ethidium bromide as 
previously described (Bidwell and Hui, 1990). 
35 Figure 6a, 6b, 7 and 8 show the stained gels for a 

series of experiments on the of PAH gene exon 12 
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mutations. 

In Figure 6(a), Lanes 1 and 12 show molecular 
weight markers (M, marker VIII, Boehringer Mannheim). 
The leading double bands In all other lanes are the 
5 homoduplexes of the synthetic DNA construct (leading 
band, 176bp) and PAH gene exon 12 (trailing band, 
185bp) PCR products. Bands of apparent molecular 
weight 400-500bp are different conformational forms of 
DNA heteroduplexes (Figs 6b and 8). Lanes 2, 11, 13 

10 and 16 show the heteroduplexes observed in normal 
individuals (controls). Other lanes show the 
variations in heteroduplexes observed in PAH gene exon 
12 mutations, denoted beneath each lane: hom, 
individual homozygous for mutation; . het, individual 

15 heterozygous for an exon 12 mutation (non-exon 12 PAH 
gene mutation present on other chromosome). Lanes 6, 8 
and 14 show Individuals with different PAH gene exon 12 
mutations on each chromosome. In all cases, 
characteristic heteroduplexes (also see Figure 6b) are 

20 observed for each mutation, and in addition the 
homozygosity or heterozygosity status is readily 
determined. 

Figure 6(b) is a schematic representation of DNA 
heteroduplex banding patterns obtained for each PAH 

25 gene exon 12 mutation. For each mutation, banding 
patterns are shown for homozygous (hom) and 
heterozygous (het) states. The mobilities of the 
bands (apparent molecular weights) are: a, 446bp; b, 
433bp; c, 416bp; d, 395bp; e, 374bp; f, 359bp; g, 

30 354bp. Bands c plus g are observed in normal PAH genes 
and In genes with non-exon 12 mutations. 

The results shown in Figure 8 were obtained in the 
same way as Figures 7a and 7b and show the 
classification of mutations in PKU individuals using 

35 DNA Isolated from dried blood spots. C, normal 

control; M, molecular weight markers (for details see 
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Fig 2 legend); lane 1, individual with R408W and 
IVS12nt:l mutations on opposite alleles; lanes 2, 3 5 
and 6, IVS12ntl heterozygous individuals; lane 4, 
IVS12ntl homozygous individual. 

Figures 6a and 6b and 8 show that the synthetic 
construct was able to generate unique and 
discriminatory DNA heteroduplexes for all five known 
exon 12 mutations within DNA from PKU individuals, both 
in the homozygous and heterozygous states. Moreover, 
the visual quality of these results compared favourably 
with those obtained with DNA isolated from fresh blood 
samples, and permitted unambiguous assignment of exon 
12 genotypes. 

The reliability and reproducibility of the test 
15 was examined by screening two sets of genomic DNA 

samples collected from PKU individuals in whom exon 12 
mutations were established by other methods, either 
prospectively or retrospectively. 

Figures 7a and 7b show the results of a blind 
trial of concordance between the DNA analysis method of 
the present invention and the results of other DNA 
typing analyses. Figure 7a represents the results 
using the first set of genomic DNA whilst Figure 7b 
represents the results using the second set of genomic 
25 DNA constituting untreated, PKU Individuals with 

reduced intelligence from a Scottish population, in 
both Figures 7a and 7b, lane M are molecular weight 
markers (for details see Figure 6 legend) and lane C is 
a normal control. In Figure 7a, the lanes were as 
30 follows: lane 1, R408W heterozygous individual; lanes 

2, 8, 9 and 11, IVS12ntl heterozygous individuals; lane 

3, individual with Y414C and IVS12ntl mutations on 
opposite alleles; lane 4, y414C heterozygous 
individual; lane 5, IVS12ntl homozygote; lane 7, 

35 individual with R408W and IVS12ntl mutations on 
opposite alleles. Lanes 6, 10 and 12 are PKU 
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indivlduals with mutations outside exon 12. In Figure 
7b, the lanes are as follows: lanes 1-7 and 9, 
IVS12ntl heterozygous individuals; lane 8, PKU 
individual with mutation outside exon 12. 
5 For the first set of genomic DNA (Figure 7a) there was 
complete concordance between results of the PAH-UHG 
tests and prospectively performed analyses using allele 
specific oligonucleotide (ASO) probes or DNA 
sequencing. 

10 In the second set of genomic DNA, constituting 

untreated, PKU individuals with reduced intelligence 
from a Scottish population, 8 out of 9 samples were 
identified as heterozygous for the IVS12ntl mutation 
(Figure 3b); these results were confirmed by ASO 

15 analysis (data not shown), and are not unexpected for 
this population (Sullivan et al, 1989). 

A practical consideration of the potential 
application of this technology is in the ability to 
rapidly genotype infants who are positively diagnosed 

20 in the neonatal screen for PKU. We addressed this by 
analysing genomic DNA prepared from a 2nm^ area of 
dried blood spots normally used in the neonatal 
screening programme. These samples had been obtained 
from affected individuals who regularly send dried 

25 blood spots to the screening laboratory for monitoring 
blood phenylalanine concentrations. The samples were 
between 1 to 4 months old and had been left at ambient 
temperatures during that time. Samples were analysed 
from PKU individuals whose genotype had been 

30 established previously using ASO probes or DNA 

sequencing. In all cases, the visual quality of these 
results (Figure 8) compared favourably with those 
obtained with DNA isolated from fresh blood samples, 
and permitted unambiguous assignment of exon 12 

35 genotypes. 

This Example shows the technique of the present 
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invention to be a highly sensitive, rapid and non- 
radioactive screening and classification system for PKU 
in which five mutations in 'exon 12 of the PAH gene can 
be identified using a single DNA construct and one PCR- 
5 based test. The tests are technically simple to 

perform and suitable for the routine screening of large 
numbers of individuals. Results were obtained within 
24 hours of receipt of blood samples, and were of equal 
visual quality when performed on whole blood (Figures 6 
10 and 7) or dried blood spots (Figure 8). Evidence from 
this and parallel studies being conducted for other 
genetic diseases, suggests that previously undescrlbed 
mutations within sequences under study may be 
identified initially by an unmodified DNA construct. 
15 Following DNA sequence analysis to identify the new 
mutation, specific modifications to the construct, 
derived from molecular modelling simulations, can 
optimise the identification of the new mutation in 
subsequent tests. The conformational and surface 
20 charge modifications induced in heteroduplexed DNA by 
constructs may be modelled. An example of the 
homoduplexed and synthetic construct-heteroduplexed 
nucleotides between codons 407 and 410 of exon 12 of 
the PAH gene using in the model the construct sequence 
25 which identifies the R408W mutation is shown in Figure 
9. Conformational differences include altered 
interatomic distances and angles, increased diameter of 
the helix (compare diameters in (b)), and changes in 
net surface charge. The changes in electrophoretic 
mobilities as a consequence of such modifications may 
be predicted to a large extent from such theoretical 
models prior to actual construction of specific 
synthetic constructs. This Example Illustrates the wide 
potential of synthetic DNA construct-based DNA 
heteroduplex technology. The technique appears ideally 
suited to the genotyping of multi-allelic genes where 



30 



35 



wo 93/19201 



-33- 



PCT/GB93/00616 



mutations are clustered within a reasonable distance 
and are flanked by conserved sequences suitable for PGR 
priming. Genotyplng mutations within separate exons In 
genomic DNA may be facilitated using exon-^ specific 
5 constructs, or potentially by a single construct for 
genotyplng cDNA derived from mRNA by reverse 
transcription PGR (Myers and Gelfand, 1991) 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: The Uiiiversity of Bristol 

(B) STREET: Senate Houses Tyndall Road 

(C) CITY: Bristol 

10 (E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): BS8 ITH 



(ii) TITLE OF INVENTION: NUCLEIC ACID ANALYSIS 
15 (iii) NUMBER OF SEQUENCES: 12 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) CC»4PUTER: IBM PC compatible 

20 (C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 
#1.25 (EPO) 



25 (2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 base pairs 

(B) TYPE: nucleic acid 
30 CC) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GCTGCAGGAG AGTGGCGCXT CCGCTCATGT CCGCCCCCTC CCCGCAGAGA 
ATTCAGTGTA 60 

40 CCACTTACG6 CAGGAAT6CT ACGCGTTTAA T6G6ACACAG CGCTTCCTGG 
AGAGATACAT 120 

CTACAACCGG GAGGAGTTCG TGCGCTTCGA CAGCGACGTG GGGGAGTTCC 
GGGC66TGAC 180 

45 

G6AGCTGGGG CGGCCGCATG AGGACAACTG GAACAGCCAG AAGGCACTCC 
T6GAAGCGG6 240 

CAGTGCCGGA CAGG6AATGC AGACACAACT ACGAGCTGGA CGATGCCGTG 
50 ACCCTGCAGC 300 

GCCGAGGTGA GT6A66GCTT TGGGCCGGAT CCG 
333 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GCTGCAGGAG AGTGGCGCCT CCGCTCATGT CCGCCCCCTC CCCGCA6AGA 
ATTCAGTGTA 60 

15 

CCACTTACGG CAG6AATGCT ACGCGTTTAA 
90 

(2) INFORMATION FOR SEQ ID NO: 3: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

30 

CCGGAACTCC CCCACGTCGC TGTCGAAGCG CACGAACTCC TCCCGGTTGT 
AGATGTATCT 60 

CTCCAGGAAG CGCTGTGTCC CATTAAACGC GT 
35 92 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGAGTTCCGG (XGGTGhCGG kGCTGGGGCG GCCGCATGAG GACAACTGGA 
50 ACAGCCAGAA 60 

GGCACTCCTG GAAGCG(KX:A GTGCCGGACA G 
91 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



15 



20 



30 



35 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CGGATCCGGC CCAAAGCCCT CACTCACCTC GGCGCTGCAG GGTCACGGCA 
TCGTCCAGCT 60 

CGTAGTT6T6 TCTGCATTCC CTGTCCGGCA 
90 

(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GCTGCAGGAG AGTGGCGCCT CCGCTCAT 
28 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGGATCCGGC CCAAAGCCCT CACTC 
25 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

TGTGGTTTTG GTCTTAG6AA CTTTGCTGCC ACAATACCTC TTACCTTCTC 
AGCCGGCTGC 60 

10 ATACACCCAA AGGATTGAG6 TCTTGGACAA TACCCAGCAG CTTAAGATTT 
TGGCTGATTC 120 

CATTAACAGA CCTTAATTTA CACCTTACGA GGCCACTCGG TTTCTCA6TA 
ATCGAA6ACT 180 

15 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 9 5 "base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



TGTGGTTTTG 6TCTTAGGAA CTTTGCTGCC ACAATACCTC ACCTTCTCAG 
30 CCGGCTACAT 60 

ACACCCAAAG 6ATTGAGGTC TTGGACAATA CCCAG 
95 

35 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AGTCTTCGAT TACTGAGAAA CCGAGTGGCC TCGTAAG6TG TAAATTAAGC 
T6TTAAT6GA 60 

50 ATCAGCCAAA ATCTTAAGCT GCTGGGTATT GTCCAAG 

97 

(2) INFORMATION FOR SEQ ID NO: 11: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

10 

TGTGGTTTTG GTCTTAGGAA 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

25 

AGTCTTCGAT TACTGAGAAA 
20 
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CLIklMS: 

1. A method for examining polymorphisms In a 
nucleotide sequence of a first sample of nucleic acid 
comprising the steps of: 

5 (a) forming a population of nucleic acid 

fragments bearing the said nucleotide sequence; 

- (b) combining the population of nucleic acid 
fragments with a population of a synthetic nucleotide 
construct which construct Is capable of forming 

10 duplexes with the nucleic acid fragments, the sequence 
of the construct being such that duplexes of different 
molecular conformation are formed between the construct 
and the nucleic acid fragments dependent upon the 
presence of a polymorphism at a known variable 

15 nucleotide or sequence of nucleotides within the 
sequence of the sample under examination; 

(c) permitting duplex formation within the 
combined populations; and 

(d) separating the duplexes formed In step (c). 
20 2. combining the population of nucleic acid 

fragments with a population of a synthetic nucleotide 
construct which construct is capable of forming 
duplexes with the nucleic acid fragments but which 
contains one or more deliberate nucleotide 

25 substitution(s) and/or deletlon(s} and/or Insertlon(s) 
which is/are (1) opposite to a known variable 
nucleotide or sequence of nucleotides within the 
sequence imder examination and/or (11) contiguous with 
a nucleotide which is opposite to a known variable 

30 nucleotide or sequence of nucleotides within the 
sequence under examination; 

2. A method according to claim 1, wherein the 
synthetic construct contains one or more deliberate 
nucleotide substitutlon(s) and/or deletion(s) and/or 

35 insertlon(s) which is/are (1) opposite to a known 

variable nucleotide or sequence of nucleotides within 
the sequence under examination and/or (11) contiguous 
with a nucleotide which is opposite to a known variable 
nucleotide or sequence of nucleotides within the 

40 sequence under examination. 

,3. A method according to claim 2, wherein the 
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nucleotlde sequence of the synthetic construct at least 
contains deliberate nucleotide 8ubstitution(s) and/or 
deletion(s} and/or Insertlon(s) contiguous with the 
nucleotide which is opposite the knovm variable 
5 nucleotide or sequence of nucleotides within the 
sequence under examination. 

4. A method according to claim 2 or 3, wherein 
the nucleotide sequence of the synthetic construct 
contains deliberate nucleotide substitution(s) and/or 

10 deletion(s) and/or insertion(s) opposite the known 
variable nucleotide or sequence of nucleotides. 

5. A method according to claim 4, wherein the 
synthetic construct Includes nucleotide substltutlon(s) 
and deletlon(s) both opposite and contiguous with the 

15 mutation* 

6. A method according to any one of claims 2 to 

5, wherein the nucleotide sequence of the synthetic 
construct contains between one and ten contiguous 
substitutions/deletions adjacent the position of the 

20 polymorphism in the nucleotide sequence under 
investigation 

7. A method according to any one of claims 2 to 

6, wherein the synthetic construct includes deliberate 
nucleotide substltution( s ) and/or deletion(s) and/or 

25 insertionCs) corresponding to at least two known 
polymorphisms in the nucleotide sequence 

8. A method according to any preceding claim^ 
wherein the construct includes a deliberate deletion or 
a series of deletions, at a position away from 

30 polymorphic sites in the nucleotide sequence under 
investigation . 

9. A method according to any preceding claim, 
wherein the nucleic acid under examination is DNA. 

10. A method accoirding to any preceding claim, 
35 wherein the synthetic nucleotide construct is a 

synthetic DNA. 

11. A method according to any preceding claim, 
wherein the population of DNA fragments bearing the 
nucleotide sequence is formed from a DNA source by a 

40 polymerase chain reaction. 

12. A method according to any preceding claim in 
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which the synthetic construct Is a DNA construct, 
wherein the population of synthetic DNA construct is 
formed by a polymerase chain reaction. 

13. A method according to any preceding claim for 
5 matching individuals in respect of one or more of the 

functional proteins expressed by the HLA (hximan 
leucocyte antigen) class 1 and II genes, wherein the 
nucleotide sequence under examination is a sequence, or 
part of a sequence, of a HLA Class I or II gene 
10 containing a known polymorphic site or sites. 

14. A method according to any one of claims 1 to 
12, for determining an individual's susceptibility to a 
genetic disease, wherein the nucleotide sequence under 
examination is a sequence^ or part of a sequence, of a 

15 gene containing a known polymorphic site or sites 
associated with a genetic defect. 

15. A method according to claim 14, wherein the 
genetic disease Is phenylketonuria genotyping and the 
gene containing a known polymorphic site or sites is 

20 the phenylalanine hydroxylase (PAH) gene. 

16. A test kit, which kit comprises: 

(a) two oligonucleotide primers suitable for use 
in PGR and capable of annealing to complement aiy 
sequences at respective ends of a nucleotide sequence 

25 to be examined; 

(b) a synthetic nucleotide (preferably DNA) 
construct which construct is capable of forming 
duplexes with the nucleotide sequence under 
consideration, the sequence of the construct being such 

30 that duplexes of different molecular conformation are 
formed between the construct and the nucleotide 
sequence under examination dependent upon the presence 
of a polymorphism at a known variable nucleotide or 
sequence of nucleotides within the sequence under 

35 examination; and 

(c) a control DNA and/or control PGR 
amplification product. 
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